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Abstract 

We prove a special case of Erdos' unit distance problem using a corol- 
lary of the subspace theorem bounding the number of solutions of linear 
equations from a multiplicative group. We restrict our attention to unit 
distances coming from a multiplicative group of rank r not too large. 
Specifically, given e > and n points in the plane, we construct the unit 
distance graph from these points and distances and use the corollary above 
to bound certain paths of length k in the graph giving at most n 1+e unit 
distances from the group above. We require that the rank r < clogn for 
some c > depending on e. This extends a result of Jozsef Solymosi, 
Frank de Zeeuw and the author where we only considered unit distances 
that are roots of unity. Lastly we show that the lower bound configuration 
for the unit distance problem of Erdos consists of unit distances from a 
multiplicative subgroup of the form above. 

1 Introduction 

In 1946 Erdos asked for the maximal possible number, u(n), of unit distances 
among n points in the plane. He gave the lower bound u(n) > n 1 + c / 1 °g 1 °g™ ) 
using a y/nx ^/n grid and conjectured that this was the true magnitude [5]. The 
best known upper bound is u{n) < cn 4 / 3 , first proved by Spencer, Szemeredi 
and Trotter in 1984 16 . This bound has many proofs, the simplest of which 
was the proof by Szekely |17j . using the crossing inequality for graphs. As a 
general reference for work done on the unit distance problem see [3] . 

In [15] Jozsef Solymosi, Frank de Zeeuw and the author showed that the 
number of unit distances between n points in the plane with angle to the x-axis 
a rational multiple of it is at most n 1+c ^ logn . Such unit distances correspond to 
roots of unity. The bound was found by using a result of H.B. Mann bounding 
sums of roots of unity [TU]. This work is an extension of that result to unit 
distances from a group of finite rank — roots of unity correspond to rank 0. The 
proof follows in almost the same way except we use the subspace theorem instead 
of Mann's result. 

Our main tool is a corollary of the subspace theorem. The subspace theorem 
was first proved by W.M. Schmidt in 1972 [TJ]. This theorem essentially says 
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that solutions of linear equations, in a multiplicative subgroup of a field of finite 
rank, come from a finite number of linear subspaces. A number of improvements, 
including quantitative versions, of this result have been made. The corollary we 
use bounds the number of such solutions depending on the rank and dimension. 
This corollary was due originally to Evertse, Schlickewei and Schmidt [7]. We 
will use an improvement of this result of Amoroso and Viada [I] . The progression 
of these results is given in [7j [6] . We will use the bound from 1 which is, to our 
knowledge, the best known bound for the corollary. The subspace theorem is a 
very powerful result with a wealth of applications in number theory. For some 
examples see [2]. 

Consider two points p,g £ I 2 with unit distance. Considering the vector 
between these two points we get the complex number z = z(p,q) = pq[ with 
\z\ — 1. We will restrict our attention to unit distances with z coming from 
a multiplicative subgroup of C* (the multiplicative group of nonzero complex 
numbers) of finite rank. A subgroup T C C* has rank r if there exists a finitely 
generated subgroup To C T with r generators such that for every x G T there 
exists an integer k > such that x G IV 

Suppose r is a subgroup of C* of finite rank r and ai, 02, . . . , Oj, 6 C*. A 
solution of the equation a±Zi + a^z-i + • • • + a-k z k = 1 is called nondegenerate 
if no subsum of the left hand side vanishes. That is a j z j ^ for every 

nontrivial J C {1,2,..., k}. We will consider the number A(k, r) of nondegen- 
erate solutions of this equation with Z{ <G T. We now give the corollary of the 
subspace theorem that we need. 

Theorem 1. Suppose aj., <i2> • • • » a>k G C* and V has finite rank r. Then the 
number of nondegenerate solutions of the eguation 

aiZi + a 2 z 2 H Va k z k = l (1) 

with zi € r is at most 

A{k.r) < (8k) 4k ^ k+kr+1 \ 

Theorem [T] was proved over an arbitrary algebraically closed field K of char- 
acteristic but we only require it over C. 

We will use this to prove the following result. 

Theorem 2. Let e > 0. Then there exist no = na(s) a positive integer and 
c = c(e) > such that given n > no points in the plane, the number of unit 
distances with z coming from a subgroup T C C* with rank r < c log n is at most 

We will prove this theorem in the next section and in Section |3] we will show 
that the lower bound of Erdos satisfies the hypotheses of this theorem. 

2 Proof of the main result 

The proof of Theorem[2]is quite similar to the proof of Theorem 2.2 in [15] . The 
main difference is that we use the subspace theorem instead of Mann's result to 
get an upper bound for paths in the unit distance graph. 
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Suppose G — G(V, E) is a graph on v(G) = n vertices and e(G) = cn 1+a 
edges. We denote the minimum degree in G by S(G). The following lemma 
shows that we can remove low degree vertices from our graph without greatly 
affecting the number of edges. 

Lemma 3. Let G be as above. Then G contains a subgraph H with e(H) = 
(c/2)n 1+a edges such that 6(H) > (c/2)n a . 

Proof. We remove vertices from G of degree less than (c/2)n a . Then, the re- 
sulting subgraph H has 5(H) > (c/2)n a and we removed fewer than (c/2)n 1+a 
edges so H contains more than (c/2)n 1+a edges. □ 

Note that the subgraph H constructed above contains at least v(H) = 
A /cn 1 / 2+Q / 2 vertices. 

Suppose we are given a path on k edges Pk = popi . . -Pk- We call this path 
irredundant if 

X] PiPi+l ^ 
iei 

for every ^ I C {0, 1, . . . , k - 1}. 

Proof of Theorem fJl Let G be the graph with the n points in the plane as 
vertices and the unit distances with z coming from T as edges. Suppose there 
are n 1+£ such distances. Then e(G) = n 1+£ . We will show that we can take 
e as small as we like. We will count the number of irredundant paths Pk in 
G, for a fixed k that we will choose later. By Lemma [3] we can assume that 
e(G) > (l/2)n 1+e ,v(G) > n 1 /^/ 2 and 6(G) > (l/2)n e . 

The number of irredundant paths Pk starting at any vertex v is at least 

fc— 1 ke 

N>l[(S(G)-2 e + l)>^. 

e=o 

The first inequality is true since if we have constructed a subpath Pi of Pk, 
then at most 2 l — 1 of the at least 6(G) possible continuations are forbid- 
den. In the second inequality we have assumed that 2 k < (l/2)n £ , which 
is true as long as fc < e log n / log 2 — 1 (we will show that this holds at the 
end of the proof). Thus the total number of irredundant paths Pk is at least 
Nn i/2+ s /2j2 > n i/2+(fe+i/2)e^2 2fc + 1 . It follows that there are two vertices v and 
w with at least Nn 1 / 2+e / 2 /n 2 > n ( fc + 1 / 2 )e-3/2 j^k i rre dundant paths P k between 
them. We will call the set of these paths V vw , so that we have 

n (fc+l/2)e-3/2 
\Vvw\ > p ■ 

Given Pk G V V w, Pk = PoPi ■ ■ - Pk, consider the fc-tuple (zi, . . . , Zk) where Zi 
is the complex number in the direction from pi-i to pi, i.e. Zi — z(pi^\ 1 pi) — 
Pi-ip\. Let a = z(v, w). Then z\+Z2+- ■ -+Zk = a. Since the path is irredundant 
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no subsum on the left vanishes. So Pk corresponds to a nondegenerate solution 
of Equation ([T]) with a* = 1/a for i = 1, 2, . . . , k. Thus, by Theorem [TJ 

\V VW \ < ( 8 fc) 4fc4(fc+fer+1) . 

Putting these inequalities together and taking logarithms we get 

((fc + l/2)£-3/2)logn < fclog4 + 4A: 4 (fc + fcr + l)log(8A;) 

< 5rk 5 log k, 

where the last inequality holds for large k. From this we get 

5rfc 5 logfc 3 5r/c 4 logfc 3 

~ (fc+ 1/2) log n 2{k + 1/2) ~ logn Ik' w 

We consider the expression on the right hand side as a function of k. Opti- 
mizing this function we get 

k > exp((l/5)W(5c 2 log n/r) 

for some constant c-i > where W is the positive real- valued function satisfying 
x — W{x)e w ^ x \ This function is called the Lambert W function and was first 
studied by J.H. Lambert in 1758 [5]. The following asymptotic expression is 
due to N.G. de Bruijn g]: 

^) = io g (x)-io g i g(,) + o(i2^2I£). 

v log log X 

We don't require this much accuracy. One can easily check, and we will just use 
the fact, that (1/2) logx < W{x) < log a; for x > e. 
Then we can take 



l \ 1/5 /i \ 1/5 

\ogn\ <k< ,,/logn^ 



for some constants c',c" > 0. 

For any e > there is a constant c > such that if r + 1 < c log n then the 
inequality in ([2]) holds for large n. When counting P^'s we made the assumption 
that k < e log nj log 2 — 1. Checking the above values of k and / we see that 
this holds for large n. 

This completes the proof. □ 

3 Analysis of Erdos' lower bound 

It would be interesting to analyze the possible group structure of unit distances 
from a maximal set of points. We will now show that the lower bound configu- 
ration for the unit distance problem given by Erdos satisfies the hypotheses of 
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Theorem [21 Matousek has given a very in-depth account of Erdos' lower bound 
and we will follow that here [TTj . 

We require the following number theoretic functions: 

TTd,a( X ) = ^2 1j $d,a( X ) = ^2 lo SP> ^d.a( X ) = lo gPi 

p<x P<x p f - <X 

p=a(d) p=a(d) p e =a(d) 

where the first two sums are over primes less than x of the form p = a + kd 
and the last sum is over primes p and positive integers £ such that jr = a + kd 
and p l < x. These are analogues of the prime counting function and Chebyshev 
functions for arithmetic progressions. 

We will use the following results regarding these functions all of which are 
well known in number theory. For details see [8] and [12] . 

Theorem 4 (The Prime Number Theorem for Arithmetic Progressions). Sup- 
pose a and d are positive integers such that (a, d) = 1. Then 

1 „ 

TTd.aH = (1+0(1)) 



f(d) logn 

A simple consequence of this result is that if a and d are positive integers 
such that a < d and (a, d) — 1 then the kth prime of the form pi = a + kid 
satisfies pf. = (1 + o(l))k log k/ip(d). 

Theorem 5. Suppose a and d are positive integers such that (a,d) = 1. Then 

71 

i>d,a{n) = + 

Theorem [5] can be deduced from Theorem 5] by partial summation. 
Theorem 6. Suppose a and d are positive integers such that (a, d) = 1. Then 

$d,a{n) = (l + o(l))Vd,a(n). 

The above two theorems give i?d,a(rc) = (1 + o(l))n/<p(d). 
We will also use the following fact. For details see [T5] . 

Theorem 7. The number of integer solutions, R(m), of x 2 + y 2 = m where 
m = p\P2 ■ ■ - Pr and the pi are distinct primes of the form p t — 4A;, + 1 is 

R(m) = 2 r+2 . 

The lower bound configuration consists of n points in a *J~n x y/n grid. The 
step in the grid is chosen to be 1/ ^/m where m is the product of the first r — 1 
primes of the form 4fc + 1 and r is the largest number with m < n/A. We will 
in fact consider a \fn x ^fn grid with step 1 and then count the distances of 
length y/m. This gives a lower bound to the unit distance problem by scaling 
the point set by 1/^/m. 
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We have &P1P2 ■ ■ -Pr-i < n < Ap\P2...p r . From this the bound r > 
logrt/(31oglogn) is found using the prime number theorem for arithmetic pro- 
gressions. Distances equal to \/m in this configuration correspond to integer 
solutions of x 2 + y 2 = m. In the lower bound, the fact that there are at least 
2 r ~ 1 /16 such distances is used. But an upper bound on the number of such 
distances can also be found. By Theorem [7] there are at most 4.2^ r ~ 1 ' +2 = 2 r+3 
such distances from any point so we have at most 2 r+3 n such distances in total. 
Erdos' construction gives a lower bound for r. If we can find an upper bound 
for r then we are done as will be described below. 

We will briefly explain the reason that m is defined as above as this highlights 
the generators to choose for a multiplicative subgroup of C* containing the unit 
distances of the configuration. A prime p has a unique expression, up to the 
order of the terms, of the form x 2 + y 2 — p with x and y positive integers if and 
only if p = 2 or p = 4fc + 1 for some integer k. The Brahmagupta-Fibonacci 
identity says that the product of two numbers, each expressible as the sum of 
two squares, is itself expressible as the sum of two squares. Specifically 

(a 2 + b 2 )(c 2 + d 2 ) = (ac - bdj 2 + (ad + be) 2 
= (ac + bd) 2 + (ad- be) 2 . 

So as we multiply more primes of the form 4fc + 1 together we get more ex- 
pressions of the resulting number as a sum of two squares. So all solutions of 
x 2 + y 2 = m can be described in terms of the solutions of x 2 + y 2 = Pj . 

More formally, we consider the ring R of points in Z 2 with addition defined 
coordinate- wise, so (a, b) + (c, d) = (a + c, b + d), and multiplication defined as 
follows (a, b) ■ (c, d) = (ac — bd, ad + be). One can check that R is actually a 
ring and is in fact isomorphic to the Gaussian integers Z[z] since the operations 
correspond to complex addition and complex multiplication. But Z[i] is a unique 
factorization domain so R is also a unique factorization domain. We will consider 
the elements of R as distance vectors. 

In our grid we are looking for the distance m = p\ . . .p r -i- By Theorem [71 
the number of pairs (x,y) € Z 2 with x 2 + y 2 = m is R(m) = 2 r+1 . Suppose 
x 2 + y 2 — pj. We consider the point (xj, yj) £ R. The product of these r — 1 
points (x,y) = (x\, yi)(x 2 , 2/2) • • • (xr-i,Vr-i) has magnitude 

\(x,y)\ = \ (xi,yi)\ ■ ■ ■ KXr-UVr-l)] = y/pl ■ ■ -Pr-1 = V™. 

So this product gives a point with length y/m. 

Now, R is a unique factorization domain. That means that the point 
(x,y) — (x\, yx) . . . (x r -i, y, — 1) has unique factorization. Specifically, in any 
other factorization of (x, y) = (x[,y[) . . . (x' r _ 1 ,y' r _ 1 ) there is a bijection of 
the factors such that (xj,yj) = Uj (^yj > v'^ij) ) where Uj is a unit. The units in 
R correspond to the units in Z[z]. In the latter these are 1, — 1, i, — i so in the 
former they are (1, 0), (—1,0), (0, 1) and (0, —1). Two elements (a, b), (c, d) G R 
arc called associates if (a, b) = u(c, d) for some unit u. So in a unique fac- 
torization domain the factorization of an element is unique up to ordering and 
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associates. Now, since the pj's are odd primes we cannot have Xj = ±yj for 
1 < j < t — 1. One can check that {xj,yj) and (xj, —yj) are not associates and 
(xj,yj), (xk,yk) are not associates for j ^ k. So we have two points to choose 
from for eachpj, namely (xj,yj) and (xj,—yj), giving 2 r_1 choices for (x,y). 
None of the factors are associates so these choices for (x, y) are all distinct. If 
we multiply a given (x, y) by a unit then we get four different values. So we 
get 4.2 r ~ 1 = 2 r+1 distinct points (x, y) each with length y/m. So these give all 
possible required distances by Theorem [Jj The units are torsion points of R 
(they have finite multiplicative order in R) so they don't affect the rank. 
Going back to unit distances, if we take the complex numbers 

Zj = m-^^-^ixj + i Vj ), Wj = m-^^-^ixj - i Vj ) 

for 1 < j < r— 1 then these generate the multiplicative group of unit distances in 
the configuration. Thus the unit distances come from a multiplicative subgroup 
of C* of rank at most r — 1. So we just need to bound r from above. 

We do this by looking at the inequality pi . . .p r -i < n/A. Taking logarithms 
we get ^4 i i(p r _i) < log(n/4). By Theorems [5] and |H] we get 

f^<>og(»/4). 

By the remark after Theorem [4] we get 
(r-l)log(r-l) 

Solving for r we get 



< 2A/21og(n/4). 



16 logn 

r < 



log log n 

Thus the unit distances come from a multiplicative subgroup of rank at most 
1 — 1 < 16 log nj log log n — 1. Since 

16 log n 

1 < clogn 

log log n 

for large n this configuration is covered by Theorem O 
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